首页> 外文OA文献 >A Fast General-Purpose Clustering Algorithm Based on FPGAs for High-Throughput Data Processing
【2h】

A Fast General-Purpose Clustering Algorithm Based on FPGAs for High-Throughput Data Processing

机译:一种基于FpGa的快速通用聚类算法   高吞吐量数据处理

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We present a fast general-purpose algorithm for high-throughput clustering ofdata "with a two dimensional organization". The algorithm is designed to beimplemented with FPGAs or custom electronics. The key feature is a processingtime that scales linearly with the amount of data to be processed. This meansthat clustering can be performed in pipeline with the readout, withoutsuffering from combinatorial delays due to looping multiple times through allthe data. This feature makes this algorithm especially well suited for problemswhere the data has high density, e.g. in the case of tracking devices workingunder high-luminosity condition such as those of LHC or Super-LHC. Thealgorithm is organized in two steps: the first step (core) clusters the data;the second step analyzes each cluster of data to extract the desiredinformation. The current algorithm is developed as a clustering device formodern high-energy physics pixel detectors. However, the algorithm has muchbroader field of applications. In fact, its core does not specifically rely onthe kind of data or detector it is working for, while the second step can andshould be tailored for a given application. Applications can thus be foreseento other detectors and other scientific fields ranging from HEP calorimeters tomedical imaging. An additional advantage of this two steps approach is that thetypical clustering related calculations (second step) are separated from thecombinatorial complications of clustering. This separation simplifies thedesign of the second step and it enables it to perform sophisticatedcalculations achieving online-quality in online applications. The algorithm isgeneral purpose in the sense that only minimal assumptions on the kind ofclustering to be performed are made.
机译:我们提出了一种快速通用算法,用于“具有二维组织”的数据的高吞吐量聚类。该算法旨在与FPGA或定制电子设备一起实现。关键特性是处理时间与要处理的数据量成线性比例。这意味着可以在具有读数的流水线中执行聚类,而不会因遍历所有数据多次而遭受组合延迟的困扰。此功能使该算法特别适用于数据密度高的问题,例如对于在高亮度条件下工作的跟踪设备(例如LHC或Super-LHC)。算法分为两个步骤:第一步(核心)对数据进行聚类;第二步分析每个数据聚类以提取所需的信息。当前算法被开发为用于现代高能物理像素检测器的聚类设备。然而,该算法具有更广阔的应用领域。实际上,它的核心并不特别依赖于它正在处理的数据或检测器的种类,而第二步可以并且应该针对给定的应用进行定制。因此可以预见到其他探测器和其他科学领域的应用,从HEP量热计到医学成像。这两个步骤的另一个优点是,典型的与聚类相关的计算(第二步)与聚类的组合复杂性分开了。这种分离简化了第二步的设计,并使它能够执行复杂的计算,从而在在线应用程序中实现在线质量。在仅对要执行的群集类型做出最小假设的意义上,该算法是通用的。

著录项

  • 作者

    Annovi, A.; Beretta, M.;

  • 作者单位
  • 年度 2009
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号